Multi-armed bandit

Results: 113



#Item
81Analysis of algorithms / Estimation theory / Normal distribution / M-estimator / Time complexity / Asymptotically optimal algorithm / Algorithm / Multi-armed bandit / Statistics / Theoretical computer science / Applied mathematics

Efficient Regret Bounds for Online Bid Optimisation in Budget-Limited Sponsored Search Auctions Long Tran-Thanh1 , Lampros Stavrogiannis1 , Victor Naroditskiy1 Valentin Robu1 , Nicholas R Jennings1 and Peter Key2 1: Univ

Add to Reading List

Source URL: research.microsoft.com

Language: English - Date: 2014-06-20 12:31:45
82Machine learning / Submodular set function / Operations research / Theoretical computer science / Natural language processing / Mathematical optimization / Multi-armed bandit / Algorithm / Linear programming / Statistics / Applied mathematics / Mathematics

Linear Submodular Bandits and their Application to Diversified Retrieval Carlos Guestrin Machine Learning Department Carnegie Mellon University

Add to Reading List

Source URL: www.select.cs.cmu.edu

Language: English - Date: 2011-10-28 13:54:18
83Markov processes / Dynamic programming / Markov decision process / Stochastic control / Distribution / Multi-armed bandit / Statistics / Mathematical analysis / Generalized functions

Selecting the State-Representation in Reinforcement Learning Odalric-Ambrym Maillard INRIA Lille - Nord Europe [removed]

Add to Reading List

Source URL: eprints.pascal-network.org

Language: English - Date: 2011-11-02 05:20:38
84Number theory / Machine learning / Multi-armed bandit / Stochastic optimization / Normal distribution / Valuation / Factorial / Mathematics / Statistics / Mathematical analysis

Beat the Mean Bandit Yisong Yue H. John Heinz III College, Carnegie Mellon University, Pittsburgh, PA, USA Thorsten Joachims Department of Computer Science, Cornell University, Ithaca, NY, USA

Add to Reading List

Source URL: www.yisongyue.com

Language: English - Date: 2011-05-11 16:32:53
85Machine learning / Submodular set function / Operations research / Theoretical computer science / Natural language processing / Mathematical optimization / Multi-armed bandit / Algorithm / Linear programming / Statistics / Applied mathematics / Mathematics

Linear Submodular Bandits and their Application to Diversified Retrieval Carlos Guestrin Machine Learning Department Carnegie Mellon University

Add to Reading List

Source URL: www.yisongyue.com

Language: English - Date: 2011-10-28 13:51:45
86Applied mathematics / Asymptotic analysis / Big O notation / Mathematical notation / Asymptotically optimal algorithm / Algorithm / Multi-armed bandit / Analysis of algorithms / Mathematics / Statistics

Latent Bandits. Odalric-Ambrym Maillard ODALRIC - AMBRYM . MAILLARD @ ENS - CACHAN . ORG The Technion, Faculty of Electrical Engineering[removed]Haifa, ISRAEL Shie Mannor The Technion, Faculty of Electrical Engineering 320

Add to Reading List

Source URL: jmlr.org

Language: English - Date: 2014-02-16 19:30:21
87Econometrics / Statistical inference / Machine learning / Confidence interval / Multi-armed bandit / Thompson sampling / Reinforcement learning / Bayes estimator / Dimensional analysis / Statistics / Measurement / Estimation theory

Thompson Sampling for Complex Online Problems

Add to Reading List

Source URL: jmlr.org

Language: English - Date: 2014-02-16 19:30:21
88Gittins index / Probability theory / Multi-armed bandit / Risk-neutral measure / Mechanism design / Statistics / Decision theory / Design of experiments

Incentivizing Exploration PETER FRAZIER, Cornell University, Ithaca NY DAVID KEMPE, University of Southern California, Los Angeles CA JON KLEINBERG, Cornell University, Ithaca NY ROBERT KLEINBERG, Cornell University, Ith

Add to Reading List

Source URL: www.cs.cornell.edu

Language: English - Date: 2014-05-07 00:31:23
89Machine learning / Cybernetics / Theoretical computer science / Multi-armed bandit / Stochastic optimization / Reinforcement learning / Greedy algorithm / Recommender system / Algorithm / Statistics / Mathematics / Applied mathematics

WWW 2010 • Full Paper April 26-30 • Raleigh • NC • USA A Contextual-Bandit Approach to Personalized News Article Recommendation

Add to Reading List

Source URL: www.research.rutgers.edu

Language: English - Date: 2010-05-02 03:42:39
90Markov processes / Stochastic control / Robot control / Reinforcement learning / Q-learning / Markov decision process / Kalman filter / Multi-armed bandit / Machine learning / Statistics / Markov models / Dynamic programming

All learning is local: Multi-agent learning in global reward games Yu-Han Chang MIT CSAIL Cambridge, MA 02139

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2004-07-01 07:47:52
UPDATE